Phylogenetic diversity within seconds.

نویسندگان

  • Bui Quang Minh
  • Steffen Klaere
  • Arndt von Haeseler
چکیده

We consider a (phylogenetic) tree with n labeled leaves, the taxa, and a length for each branch in the tree. For any subset of k taxa, the phylogenetic diversity is defined as the sum of the branch-lengths of the minimal subtree connecting the taxa in the subset. We introduce two time-efficient algorithms (greedy and pruning) to compute a subset of size k with maximal phylogenetic diversity in O(n log k) and O[n + (n-k) log (n-k)] time, respectively. The greedy algorithm is an efficient implementation of the so-called greedy strategy (Steel, 2005; Pardi and Goldman, 2005), whereas the pruning algorithm provides an alternative description of the same problem. Both algorithms compute within seconds a subtree with maximal phylogenetic diversity for trees with 100,000 taxa or more.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Mitochondrial Diversity and Phylogenetic Structure of Marghoz Goat Population

The genetic diversity and phylogenetic structure was analyzed in Marghoz goat population by mitochondrial DNA sequences. Phylogenetic analysis was carried out using hyper variable region 1 (968 bp) obtained form 40 animals. Marghoz goat proved to be extremely diverse (average haplotype diversity of 0.999) and the nucleotide diversity values 0.022. A total of 40 Marghoz goats were grouped into s...

متن کامل

Phylogenetic Diversity within Seconds

—We consider a (phylogenetic) tree with n labeled leaves, the taxa, and a length for each branch in the tree. For any subset of k taxa, the phylogenetic diversity is defined as the sum of the branch-lengths of the minimal subtree connecting the taxa in the subset. We introduce two time-efficient algorithms (greedy and pruning) to compute a subset of size k with maximal phylogenetic diversity in...

متن کامل

Genetic diversity of Arum L. based on plastid marker

TrnL-F region including intron trnL (UAA) and trnL (UAA) - trn (GAA) spacer in the large single-copy region of the chloroplast genome is widely used to infer phylogenetic relationships in plants. In this study, we obtained the trnL-F sequences from 8 samples of Arum L. in Iran. Phylogenetic analyses were conducted by the Bayesian inference, maximum parsimony, and maximum likelihood methods. The...

متن کامل

Phylogeny and genetic diversity of Fusarium graminearum species complex associated with Fusarium head blight of wheat in Moghan plain (Iran)

Thirty-seven isolates of Fusarium graminearum species complexobtained from wheat heads with Fusarium head blight symptoms were selected and used for phylogenetic studies. They were collected from different localities of Moghan plain (Ardebil province, Iran). Partial sequences of translation elongation factor 1-alpha (TEF), putative reductase (RED) and UTP-ammonia ligase (URA) genes were amplifi...

متن کامل

The Major Sources of Genetic Differentiation Among Apricot Latent Virus (ApLV) Isolates

Background and Aims: Apricot latent virus (ApLV) is a species within Foveavirus genus (Betaflexiviridae family, Tymovirales order). Phylogenetic analyses using different ORFs nucleotide sequences divided most ApLV isolates into two clusters. However, there is little data about the sources of genetic differentiation among ApLV isolates. Materials and Methods: Partial coat protein (CP) sequences...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • Systematic biology

دوره 55 5  شماره 

صفحات  -

تاریخ انتشار 2006